A Study in Entire Chromosomes of Violations of the Intra-strand Parity of Complementary Nucleotides (Chargaff's Second Parity Rule)
نویسندگان
چکیده
Chargaff's rule of intra-strand parity (ISP) between complementary mono/oligonucleotides in chromosomes is well established in the scientific literature. Although a large numbers of papers have been published citing works and discussions on ISP in the genomic era, scientists are yet to find all the factors responsible for such a universal phenomenon in the chromosomes. In the present work, we have tried to address the issue from a new perspective, which is a parallel feature to ISP. The compositional abundance values of mono/oligonucleotides were determined in all non-overlapping sub-chromosomal regions of specific size. Also the frequency distributions of the mono/oligonucleotides among the regions were compared using the Kolmogorov-Smirnov test. Interestingly, the frequency distributions between the complementary mono/oligonucleotides revealed statistical similarity, which we named as intra-strand frequency distribution parity (ISFDP). ISFDP was observed as a general feature in chromosomes of bacteria, archaea and eukaryotes. Violation of ISFDP was also observed in several chromosomes. Chromosomes of different strains belonging a species in bacteria/archaea (Haemophilus influenza, Xylella fastidiosa etc.) and chromosomes of a eukaryote are found to be different among each other with respect to ISFDP violation. ISFDP correlates weakly with ISP in chromosomes suggesting that the latter one is not entirely responsible for the former. Asymmetry of replication topography and composition of forward-encoded sequences between the strands in chromosomes are found to be insufficient to explain the ISFDP feature in all chromosomes. This suggests that multiple factors in chromosomes are responsible for establishing ISFDP.
منابع مشابه
A Proposed Solution to the Historic Puzzle of Chargaff's Second Parity Rule
Chargaff ’s first parity rule for the contents of the four nucleotides in DNA is easily understood based on the double-stranded DNA structure. However, the second parity rule, based on similar nucleotide relationships in singlestranded DNA, has been a puzzle in molecular biology, because it is impossible to imagine how pairs of G and C, and A and T are formed in the single DNA strand. In the pr...
متن کاملA model capturing novel strand symmetries in bacterial DNA.
Chargaff's second parity rule for short oligonucleotides states that the frequency of any short nucleotide sequence on a strand is approximately equal to the frequency of its reverse complement on the same strand. Recent studies have shown that, with the exception of organellar DNA, this parity rule generally holds for double-stranded DNA genomes and fails to hold for single-stranded genomes. W...
متن کاملAnalysis of single-strand exceptional word symmetry in the human genome: new measures.
Some previous studies suggest the extension of Chargaff's second rule (the phenomenon of symmetry in a single DNA strand) to long DNA words. However, in random sequences generated under an independent symbol model where complementary nucleotides have equal occurrence probabilities, we expect the phenomenon of symmetry to hold for any word length. In this work, we develop new statistical methods...
متن کاملMismatch Repair Error Implies Chargaff's Second Parity Rule
Chargaff’s second parity rule (PR2) holds empirically for most types of DNA that along single strands of DNA the base contents are equal for complimentary bases, A = T,G = C. A Markov chain model is constructed to track the evolution of any single base position on a given single strand of DNA whose organism is equipped with the process of mismatch repair. Under the key assumptions that the mism...
متن کاملAsymptotically increasing compliance of genomes with Chargaff's second parity rules through inversions and inverted transpositions.
Chargaff's second parity rules for mononucleotides and oligonucleotides (CIImono and CIIoligo rules) state that a sufficiently long (> 100 kb) strand of genomic DNA that contains N copies of a mono- or oligonucleotide, also contains N copies of its reverse complementary mono- or oligonucleotide on the same strand. There is very strong support in the literature for the validity of the rules in c...
متن کامل